Exploring Datatype Usage Space

نویسندگان

  • Graeme E Moss
  • Colin Runciman
چکیده

Quantifying the use of a data structure makes benchmarking data structures easier and more reliable. We explore diierent ways of quantifying datatype usage. We present a basic solution and examine three extensions to this solution. 1 Motivation Suppose we have a selection of data structures performing similar tasks. How do we compare their eeciency? Traditionally we might choose a few benchmarks, and run each benchmark with each data structure. This has two drawbacks. Firstly, it may be hard to nd or create appropriate benchmarks. Secondly, the eeciency of a data structure may depend heavily on how it is used, though it may be unclear how a benchmark uses the data structure. However, if we could quantify use accurately, and create a benchmark for any given use, we could avoid both problems as follows. Imagine the space of possible uses of a data structure. Map out this space using a well chosen set of coordinates. Choose points at regular intervals in this space. For each point, create a benchmark with this use. Run this benchmark with each competing data structure. We now have a table of eeciencies qualiied by a broad selection of uses. See Fig.1. Suppose now we have an application using any one of these data structures. By nding how this application uses the data structure, we can identify a point in the usage space. By examining the nearest point in our results table , we can estimate the performance of the competing data structures for this application|see Fig.1 again. This estimation will in general be more reliable than that obtained using hand-picked benchmarks. If the application changes how it uses the data structure|because we change the input data or the program itself|we can justify any change in data structure with knowledge about the change in use. We need to deene the use of a data structure: Section 2 describes a model capturing datatype usage. Section 3 deenes a basic set of coordinates for the datatype usage space. Section 4 examines principles for mapping out the data-type usage space. Section 5 presents three possible extensions to the basic set of coordinates. Section 6 refers to related work. Section 7 concludes. 2 Datatype Usage Graph We want to capture the datatype usage of an application. The application can use one of many competing data structures. To abstract away from the choice of

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extending DL Reasoning Support for the OWL Datatyping (or "Why Datatype Groups?")

The OWL [2] datatype formalism (or simply OWL datatyping) presents some new requirements for DL reasoning services, in terms of semantics (to allow the use of so-called 'un-supported' datatypes), expressive power (to support enumerated datatypes) and datatype construction mechanism (both datatypes and datatype expressions). On the other hand, OWL datatyping is expected to be extended to include...

متن کامل

Supporting RFID-based Item Tracking Applications in Oracle DBMS Using a Bitmap Datatype

Radio Frequency Identification (RFID) based item-level tracking holds the promise of revolutionizing supply-chain, retail store, and asset management applications. However, the high volume of data generated by item-level tracking poses challenges to the applications as well as to backend databases. This paper addresses the problem of efficiently modeling identifier collections occurring in RFID...

متن کامل

Sociological Investigation of the Usage of Cyber Space and its Relation with Social Isolation (Case Study: Science and Research Branch Students)

The development and improvement of electronic communication instruments and means in postmodern societies with fast intention has caused further multiplication in meanings and content of the communication itself. On the other hand the involvement of cyber space in our privacy has caused massive personality changes in people particularly the youth, In a way that individuals are separated from re...

متن کامل

Exploring the relationship between course structure and etext usage in blended and open online courses

We use a two-parameter family of bounded distribution functions (Kumaraswamy) to fit electronic textbook (etext) usage in 20 blended and online courses from Michigan State University, MIT, and edX. We observe clusters of courses in the parameter space that correlate with course structural features such as frequency of exams.

متن کامل

Exploring the Relationships between Spatial and Demographic Parameters and Urban Water Consumption in Esfahan Using Association Rule Mining

In recent years, Iran has faced serious water scarcity and excessive use of water resources. Therefore, exploring the pattern of urban water consumption and the relationships between geographic and demographic parameters and water usage is an important requirement for effective management of water resources. In this study, association rule mining has been used to analyze the data of municipal w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998